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Abstract 

We propose a novel method of community detection that is computationally inexpensive and pos- 
sesses physical significance to a member of a social network. This method is unlike many divisive and 
agglomerative techniques and is local in the sense that a community can be detected within a network 
without requiring knowledge of the entire network. A global application of this method is also introduced. 
Several artificial and real-world networks, including the famous Zachary Karate club, are analyzed. 

1 Introduction 

It has been shown in the past that many interesting systems can be represented as networks composed of 
vertices and edges ^ [21 El ^ • Such systems include the Internet 5 , social and friendship networks [H] , food 
webs and citation networks (HHHj. For example, a social network may represent people as vertices and 
edges linking vertices when those people are on a first-name basis. 

A topic of current interest in the area of networks has been the idea of communities and their detection. A 
Community could be loosely described as a collection of vertices within a graph that are densely connected 
amongst themselves while being loosely connected to the rest of the graph |101 111! IT^ ] . Many networks 
exhibit such a community structure and this motivates our work. This description, however, is somewhat 
vague and open to interpretation. This leads to the possibility that different techniques for detecting these 
communities may lead to slightly different yet equally valid results. We emphasize this variation in Section 

Several techniques have been proposed to detect community structure inside of a network. The recent 
and highly successful betwcenness centrality algorithm due to Newman and Girvan ^] ^] performs 
well within a variety of networks but it is costly to compute (0(n 2 m) on a graph with n vertices and m 
edges) |15| . More importantly, while betweenness centrality has been shown to be a useful quantity for 
detecting community structure, it is knowledge not usually attainable to a vertex within the graph. 

In this paper we ask, if a person were to move to a new town, what actions would he or she take to 
see what community or communities they belong to? Most community detection methods using hierarchical 
clustering fall within two categories: divisive and agglomerative [HI Both forms, including those using 
betweenness and other methods, are global algorithms and don't represent feasible actions that a member of 
a network could undertake to identify the network's community structure. The method proposed here may 
better represent actions that members of a network would undertake to identify their own communities. 



2 The Algorithm 

The proposed algorithm consists of an Z-shell spreading outward from a starting vertex. As the starting 
vertex's nearest neighbors and next nearest neighbors, etc. are visited by the Z-shell, two quantities are 
computed: the emerging degree and total emerging degree. The emerging degree of a vertex is defined as the 
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number of edges that connect that vertex to vertices the /-shell has not already visited as it expanded from 
the previous 1 — 1,1 — 2, ... -shells. Note that edges between vertices within the same /-shell do not contribute 
to the emerging degree. 

Let us define the following notation for the emerging degree and total emerging degree: 

fcf (j) = emerging degree of vertex i from a 
shell started at vertex j. 
Kj = total emerging degree of a shell of 

depth / starting from vertex j. (1) 

The total emerging degree of an /-shell is then the sum of the emerging degrees of all vertices on the 
leading edge of the /-shell. This can also be thought of as the total number of emerging edges from that 
/-shell We see that the total emerging degree at depth I is not necessarily the number of vertices at 
depth / + 1. At depth 0, the total emerging degree is just the degree of the starting vertex. At depth /, it is 
the total number of edges from vertices at depth / connected to vertices at depth > /. 

It follows from Eqn. ^tha^ 1 

K$ = ^ 



k j = E ** tf) ( 2 ) 



where Sj is the leading edge of the /-shell, that is, the set of all vertices exactly / steps away from vertex j. 
In addition, let us define the change in total emerging degree: 

AK > - St (3) 

K 3 

for a shell at depth / starting from vertex j. 

The algorithm works by expanding an /-shell outward from some starting vertex j and comparing the 
change in total emerging degree to some threshold a. When: 

Aifj < a (4) 

the /-shell ceases to grow and all vertices covered by shells of a depth < / are listed as members of vertex j's 
community. 

We describe our algorithm roughly as follows. For a starting vertex j: 

1. Start an /-shell, / = 0, at vertex j (add j to the list of community members) and compute K® . 

2. Spread the /-shell, / = 1, add the neighbors of j to the list, and compute Kj 

3. Compute AKj. If AKj < a, then a community has been found. Stop the algorithm. 

4. Else repeat from step 2 for the next /-shell, until a is crossed or the entire connected component is 
added to the community list. 

See Algorithm ^ for more exact pseudocode. 



Since there tend to be many interconnections within a community, so definied, as an /-shell grows outward 
from some starting vertex within a community, the total emerging degree will tend to increase. See Section 
13 for more discussion of an idealized graph model with community structure. When the /-shell reaches the 
"border" of the community, the number of emerging edges will decrease sharply. This is because, at this 
point, the only emerging edges are those connecting the community to the rest of the graph which are, by 
definition, less in number than those within the community. 
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By introducing a single parameter, a, and monitoring AiiTj, the {-shell's growth can be stopped when it 
has covered the community. It is this fact that allows for the starting vertex to detect its community locally: 
at the last depth before a is crossed, it does not matter where the emerging edges lead. See Section |2~TI for 
results using our purely local method. 

Our method is not perfect, however, and it is possible for the Z-shell to "spill over" the community it 
is detecting. This is dependent on how the starting vertex is situated within the graph: if it is closer (or 
equally close) to some non-community vertex or vertices than to some community vertices, the {-shell may 
spread along two or more communities at the same time. To alleviate this effect, one can run the algorithm 
N times, using each vertex as a starting vertex, and then achieve a group consensus as to which vertices 
belong to which communities. This idea is discussed in Section 12.21 



Algorithm 1: Local algorithm to determine a starting vertex's community 1 . Note that emerging is 
a function of the {-shell and the graph that returns the total emerging degree. 

s G V; // s is the starting vertex 

K d ~ x < — T 

Q < — empty queue; // search queue 

C < — empty queue; // community queue 

enqueue s — ► Q; 

< — emerging (Q, C, G(V, E)); 



Ks 1 
while AKf > a do 

K d-1 K d. 

foreach q G Q do 

dequeue q < — Q: 

enqueue q > C; 

enqueue neighbors(q) — ► Q; 
end 

K* < — emerging (Q, C, G(V, E)); 
AK d , Kf . 

end 



The idea of having an expanding Z-shell encompass a community is not in itself new here. The hub-based 
algorithm in JSj expands multiple Z-shells simultaneously from the n vertices of highest degree (the hubs) 
until all vertices are within an {-shell. While computationally inexpensive, this method has the following 
drawback: the number of communities detected is arbitrarily pre-assigned and the algorithm neglects the 
possibility of having two hubs within the same community. In addition, it requires knowledge of the entire 
graph; it is a global algorithm, not local. 

2.1 Select Local Results 

We have applied Algorithm the Zachary Karate club as shown in Figure Q] Figure shows the actual 
split the club underwent. We complete two runs, one starting from vertex 17 and another from vertex 24. 

^ote that this algorithm differs slightly from the one outlined in the text of Section|5] In the pseudocode we have essentially 
chosen a KJ = 1 to seed the algorithm. This has an impact on a: if a is larger than the degree of the starting vertex, then the 
/-shell's growth will terminate immediately and the result will be a singleton community. Throughout the paper, we have used 
the algorithm described in the text: Aif^ is the first value compared to a and requires gathering both JsT? (which is just the 
degree of vertex j) and Kj. When starting the algorithm in this way, the Z-shell will always spread to the immediate neighbors 
of the starting vertex before the change in total emerging degree is compared to a. This results in all runs listing the neighbors 
of the starting vertex as member's of that starting vertex's community, regardless of a. This only impacts the results for large 
a (which usually lead to a final result of N singleton communities anyway) and has no effect on the results presented here. 
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Figure 1: Two local results on the Zachary Karate 
Club with a — 1.9. Boxes and diamonds represent 
the output of Algorithmf^when starting from vertex 
24, while circles and diamonds represent the output 
when starting from vertex 17. 



Figure 2: Actual breakdown of the Karate 
Club. [Ej 



Five vertices (3, 9, 14, 20, and 32) were claimed by both runs as members of that starting vertex's community. 
Please note that this graph and all subsequent graphs and dendrograms were drawn using 17 . 

We interpret our results as follows. The five vertices that are listed as members of both starting vertex's 
communities tend to fall on the "border" between the two groups. This makes sense to us since each vertex 
is linked roughly equally to both communities. One can imagine these five members had the most difficult 
choice to make when the club split. Far from being an unwanted result, this overlap could be used to predict 
vertices that may be more likely to switch communities in the future (in an evolving network) or which 
vertices are least isolated within a single community. 

2.2 Obtaining Global Information 

Algorithm ^ is a method for a single vertex to determine something about its community membership. 
It seems reasonable that, by surveying all the locally-determined membership listings, one should be able 
to generate an idea of the global structure of the network. Here we propose a simple method using a 
membership matrix to obtain such a picture and to overcome membership overlap (discussed in Section l2~T|) 
when determining a "consensus" partitioning of the network. 

For any given starting vertex j, Algorithm^can return a vector, Vj of size N, where the ith component is 
1 if vertex i is a member of the starting vertex's community and otherwise. These vectors can be assembled 
to form a N x N membership matrix, 

M = (v 1 |v 2 |...|v JV )* (5) 

where the j'th row contains the results from using vertex j as the algorithm's starting point. This allows for 
a good way to visualize the resultant data when starting the algorithm from multiple vertices. 

We define a Distance between rows i and j of the membership matrix as the total number of differences 
between their components: 

n 

Distance^, j) = n - 5 (M ik ,M jk ) (6) 

k=l 

where S (M$fc, Mj&) = 1 if Mjfc = Mj k and otherwise. 

Now we perform a simple sorting algorithm on M. For the ith row: 
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Figure 3: Membership matrix M for the Zachary Figure 4: Sorted membership matrix M for the 
Karate Club before sorting, with a = 1.2. Red Karate Club , with a = 1.2. 
boxes indicate a value of 1, blue 0. 

1. Find Distance^, j) for all rows j > i. 

2. Pick the row that is the smallest Distance to row i (call it row k) and interchange it with row 
z + l. This requires swapping rows i + 1 and k and swapping columns i + 1 and /c. Columns are 
swapped because a row interchange is equivalent to a renumbering of the involved vertices, so that new 
numbering must be kept consistent throughout M. 

3. Repeat for row i + 1. 

Unfortunately, the sorting step can be computationally expensive. Finding Distance^, j) costs O(N). 
When the sorting algorithm begins at the first row, there are N — 1 Distances to find, so the cost of the first 
sort is 0{N{N - 1)) ~ 0{N 2 ). This is then repeated for the next row, costing 0(N(N - 2)) ~ 0(N 2 ), and 
so on for each additional row. Since there are N rows, the total cost is: 

N ( 1 \ 

^N(N -i) = N (N 2 - -N (N + 1)J =0(N 3 ) (7) 

The result of this sorting/renumbering is a membership matrix that is much more indicative of structure. 
Specifically, we have a sorted membership matrix, 

M = P l MP (8) 

where P is a permutation matrix effectively resulting from the above. Well-separated communities appear 
as blocks along the diagonal, and imperfections within the blocks can indicate substructure (See Figures 
and EJ| . 

2.3 Finding a hierarchy of sub-communities 

Sorting the membership matrix already provides a useful means of visualizing the results of all the different 
runs of the local algorithm, but this is not enough to determine how any present sub-communities relate to 
larger communities. Therefore, here we introduce a further operation to apply to M to generate a dendrogram 
of the community structure. 
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For row i, we compute a cumulative row distance, CD,;: 



CD! 

CD, 



Distance^, i — 1) + CD;_i 







^Distance(j — 1, j) 

i=2 



(9) 



Plotting the row number i versus the cumulative distance CD^ will yield a collection of points of increasing 
value falling into discrete bands that indicate the members of each community. Note that the row number i 
is the new sorted number i for that vertex: one needs to keep track of all the individual sorting operations 
to maintain the original number of that vertex, that is, through the permutations P. This step of finding 
these cumulative distances costs ~ 0{N 2 ) operations. See Figures 151 1131 IT71 and 1201 for plots of examples 
of these cumulative row distances for various networks. These plots are useful for visualization but are not 
strictly necessary to get the sub-community hierarchy. 

Finally, to yield a dendrogram of the community structure, the following operation is performed: 

1. d< — 1. 

2. Compute Distance^ — for all i = 2..n. 

3. Choose the smallest Distance (often zero for identical rows) and call it D m i n . 

4. Cd < — empty queue. // clustering queue 

5. enqueue 1st vertex — > Cd- 

6. FOR i = 2. .n : 

6.1. IF Distance^ — > D m i n : 

6.1.1. d< — 

6.1.2. Cd < — empty queue. 

6.2. enqueue ith. vertex — > Cd- 

7. Repeat from 3 for next smallest Distance until all Distances have been used. 

Essentially, we are moving down the rows of M and grouping together all the vertices whose corresponding 
rows are closer together than D m in until we arrive at a row that is farther away than D m i n . Then we start 
a new group and begin grouping the subsequent vertices together until we again find a row that is farther 
away than D m i n , and so forth. This is then repeated using the next smallest Distance as D m i n . This has a 
course-graining effect: as we use larger Distances for D m i n , farther vertices will start grouping together. 

Grouping the rows of M in this way is equivalent to grouping the vertices of the graph together into 
a sub-community hierarchy. This is also similar in form to many agglomerative techniques, with the row 
distances of M used as a similarity measure. These groupings can then be used to generate a dendrogram 
of the sub-community structure if we assume that each vertex is a singleton before we started grouping and 
that after the largest Distance is used, all vertices are grouped together. See Figures ITU1 1141 ITHl 1211 1241 and 
12 71 for such dendrograms. 

2.4 The impact of a 

The algorithm is based on a single parameter, a, which controls when to stop the spread of the /-shell. When 
a = 0, the /-shell will never stop until the entire connected component has been visited. As a increases in 
size, /-shells will tend to stop growing sooner, until eventually they do not spread beyond the starting vertex 
and the final result will be N singleton communities. This is guaranteed to happen when a > k max , where 
kmax is the largest degree in the network. 
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Figure 5: Membership matrix for Ideal graph 3 after 
sorting, with a = 0.25. Note the increased number 
of "spilled" vertices in the the middle rows, as com- 
pared with Figure [TBI 



Figure 6: Membership matrix for Ideal graph 3 af- 
ter sorting, with a = 3. The larger value of a 
stops the /-shells sooner, allowing for the smaller 
sub-communities to become evident instead of the 
main partitioning. 



The impact of varying a is readily apparent in Figures [S] and In Figure El the smaller a allowed the 
/-shells to spread farther: many /-shells starting from vertices close to the main partition (the two edges of 
highest betweenness) have spread to the entire network. In Figure El the larger value of a truncated the 
/-shells before they had a chance to spread beyond the sub-communities of the starting vertices. 

In contrasting Figures and El we emphasize that how one defines a communitiy through an algorithm 
bears on the specific results. It is our contention that there is not a single true answer for a community 
partition. We hold that the flexibility of a parameter like a to allow for various levels of community courseness 
is in fact quite natural: the result is in the eye of the beholder - of the specific algorithm. In any case, as 
can be seen by our examples in Section intermediate values of a lead to community partitions which 
agree well with many found in the literature, and we think that they make good sense in light of our model 
interpretation as described in the next section. 

One can think of a as a measure of the "friendliness" of the starting vertex, to use a social network 
analogy. When a is small (a -C 1), the /-shells will spread to much of the network. This can indicate 
vertices that are more likely to include other vertices in their respective communities or, in a social network, 
people who are more welcoming of their neighbors. Similarly when a is large (a 1), the /-shells will 
stop growing immediately. This can be indicative of vertices that are unlikely to include other vertices in 
their community, or hermit-like people who are unwilling to accept even their immediate neighbors into 
their communities, instead preferring to remain a singleton. In this sense, a can be thought of as an inverse 
measure of friendliness or social acceptance. 



3 Motivating Examples 

We propose the following idealized models for a network with simple community structure, and we test our 
algorithm first in these cases. Our models demonstrate the high-density intraconnections and low-density 
interconnections. We define an idealized community of size N as a complete subgraph (kj = (k) = N — 1), 
with one or more additional edges linking it to one or more additional ideal communities. 

These networks represent the extreme fulfillment of the idea of a community. Each community has the 
maximum number of internal links possible while having close to the minimum number of external links. 
This results in the number of emerging edges dropping off very sharply at the border of each subgraph, 



7 



leading to nearly identical results when a and the starting vertices are varied. 

Several configurations of these ideal networks are analyzed (Figures ^] and 1151 ). These networks 
also provide a means of visualizing, understanding, and interpreting how the membership matrix may look 
for a given community structure. In addition, these networks contain single vertices situated between the 
subgraphs. Since these vertices are equally connected to multiple communities, the results from Algorithm 
starting from these vertices will contain all the subgraphs that the vertices are linked to. This is evident 
in the rows of the membership matrix that overlap two or more blocks. 

Through these models, we can better understand how to interpret the performance of our algorithm; we 
believe these models in fact make suitable benchmarks for other community partitioning algorithms found 
in the literature. One can easily assemble a graph with a given community structure, apply a community 
partitioning algorithm to it, and compare the results of that algorithm with the structure created when the 
graph was assembled. 

In addition, it is useful to note that these networks require little or no sorting of their membership 
matrices. This is because the vertices are already numbered consecutively: vertices 1 through i are community 
1, vertices i+ 1 through j are community 2, etc. This is, of course, a contrived result and cannot be expected 
in general. 

4 Real- World Networks 

The proposed algorithm performs extremely well on idealized networks (see Section OJl , but how does it 
perform on real-world networks? Here we analyze the Zachary Karate Club, the network of co-appearances 
present in the novel Les Miserables by Victor Hugo, and the partisan network of co-purchased books on 
American politics. 

4.1 Zachary Karate Club 

The Zachary Karate Club is perhaps the most famous network in terms of community structure . The club 
suffered from infighting and eventually split in half, providing actual evidence of the community structure, 
at least at the top-most level. Thus, it provides an excellent means to compare the accuracy of any proposed 
detection methods. 

For a = 1.2, we achieve a reasonable result: three vertices (3, 14, and 20) are labeled incorrectly as 
compared with the betweenness partitioning 15: and the actual split the club underwent |19| . Looking at 
the network itself (Figure H5) l. all of the disputed vertices are situated on the "border" of one community or 
the other. One would expect these vertices to be the most likely to be labeled incorrectly because, unlike 
more idealized networks, these vertices are almost equally connected to both communities. 

Figures l4l 1201 and 1211 contain the membership matrix, cumulative row distances plot, and dendrogram, 
respectively. 

4.2 Books on Politics 

A network possessing a fairly simple two-community structure is the network of co-purchased books on 
American politics shown in Figure |22 20]. As can be seen from Figures l2~3l and l2"4l the results are extremely 
reasonable. 

4.3 Les Miserables 

Another network with an interesting community structure is the network of character co-appearances from 
the novel Les Miserables by Victor Hugo JB]- This network, shown in Figure [351 differs from the karate club 
and the political books networks in that there are several communities of smaller size rather than two large 
communities. As can be seen from the membership matrix in Figure EE! some of the communities separated 
quite well, while others were detected rather poorly, at least compared to the results in [TK] , 



8 



Figure 7: Ideal graph 1: Two complete subgraphs 
of size 15 bridged by a common vertex. 
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Figure 9: Cumulative row distances for Ideal graph 
1. Generated using the membership matrix shown 
in Figure |H| 



Figure 8: Membership matrix for Ideal graph 1. 
Note that no sorting was required, a = 1, but for 
these idealized model graphs, the results are identi- 
cal for a wide range of a values. 




Figure 10: Dendrogram for Ideal graph 1. Notice 
that central member vertex 16 is idealized here as 
a community unto itself, which in light of the form 
of the graph in Figure [7| simply means that 16 is 
central between two communities. Notice also the 
special placement of members 15 and 17. 
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Figure 11: Ideal graph 2: Three complete sub- Figure 12: Membership matrix for Ideal graph 2. 
graphs, one larger than the others. 
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Figure 14: Dendrogram for Ideal graph 2. Again, as 
Figure 13: Row Distances for Ideal graph 2. Gener- in FigureUHl notice the results for central members 
ated using the membership matrix shown in Figure 22, 43, 72, and also the boundary members 1, 21, 
E21 23, 42, 44, and 61. 
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Figure 15: Ideal graph 3: Three complete subgraphs 
joined together by multiple edges with singletons 
attached bridged by a single vertex to another group 
of similar (but not identical) structure. 
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Figure 17: Row Distances for Ideal graph 3, a — 
0.87. Generated using the membership matrix 
shown in Figure IT^l 
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Figure 16: Membership matrix for Ideal graph 3, 
a = 0.87. 





Figure 18: Dendrogram for Ideal graph 2, a = 0.87. 
Now with central members, for different scale com- 
munities, the dendrogram becomes more compli- 
cated in contrast to those in Figures UHl and ITU 
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Figure 19: The Zachary Karate club. The coloring 
indicates the membership of the two clusters of the 
topmost branch of the dendrogram fFigurel2*T]) 



Figure 20: Cumulative row distances for the 
Karate club, computed using the membership ma- 
trix shown in Figure|U(a = 1-2). 




Figure 22: The network of co-purchased books on Figure 23: Membership matrix for the Books on 

American politics |3U]. Here a link is drawn between Politics network, a = 1.2. 
two vertices if those books were purchased together 
from a major online retailer. 




Figure 24: Dendrogram for the Books on Politics network. Generated using the membership matrix shown 
in Figure [2*51 Note that vertices 35 and 37 are rows 34 and 35, respectively, of the membership matrix. 



5 Conclusions 

In this paper, we have introduced Algorithm ^ a new method for detecting community structure. This 
method is local and may be applied in situations where other methods are too inefficient; for example, when 
one is concerned with a single community and not the complete community structure of a graph. A single 
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Figure 25: The network of character co-appearances Figure 26: Membership matrix (after sorting) for 
from the novel Les Miserables by Victor Hugo. See the Les Mis network, a = 6.9. 
Figure |2"7I for partitioning. 




Figure 27: Dendrogram for the Les Mis social network. Generated using the membership matrix shown in 
Figure HO 



parameter, a, is used, making it very easy to tune the output of the algorithm, as was shown in Section [2.41 
We have also proposed one possible method of applying Algorithm globally (see Sections 12.21 and 
12.31) . Sorting the membership matrix is expensive and limits this global application to smaller networks. In 
addition, the membership matrix is, unlike an adjacency matrix, not necessarily sparse for a sparse graph: 
it will only be sparse when the sizes of the individual communities are all much smaller than the size of the 
network as a whole. This limits the possibility of replacing the membership matrix with a more efficient 
data structure. 

We feel that Algorithm ^ is a useful result, due to both its simplicity and flexibility. For example, our 
global application could be easily altered to become more efficient: one imagines the cost of the sorting 
algorithm can be offset a great deal by only starting Algorithm ^ from a fraction of the vertices, /, instead 
of all vertices in the graph. This reduces the cost of the sorting algorithm by a factor of / 3 , a real savings 
for small /, with the presumed tradeoff being a reduction in accuracy as / decreases. Other applications of 
Algorithm ^ besides the expensive use of the membership matrix may also be discovered. 

The Zachary Karate club has become an almost canonical representative of a community structure. The 
possibility remains, however, that outside factors may not be represented in the dataset. If this is true, then 
the club's fissure should not be used as the sole means of justification for a community detection method. 
For example, some of the border nodes, represented as diamonds in Figure ^ could well have joined either 
community, based solely on the network at hand. This can lead to ambiguity in the final partition. The 
point is that the algorithm defines the community; the community should not define the algorithm. 

Another concept, often neglected in determining communities, is the idea of a relative result. Who is 
to say that someone in a town agrees with what community the rest of the town feels he or she belongs 
to? It seems feasible that a vertex that is equally linked to two communities in a graph is just as likely to 
correspond to a person who thinks he or she is a member of both communities as it is to correspond to a 
person who feels they are independent of both larger communities. If one considers the output of Algorithm 
^to be what the starting vertex "believes" to be his or her community, then this method may prove to be a 
useful tool when analyzing relative results. This is not useful for many applications of community detection 
where a final structure is necessary, such as minimizing crosstalk between parallel processors in a computer, 
but it may prove very useful in areas such as social networks. 
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